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Abstract 

We construct different almost Poisson brackets for nonholonomic systems than those existing 
in the literature and study their reduction. Such brackets are built by considering non-canonical 
two-forms on the cotangent bundle of configuration space and then carrying out a projection onto 
the constraint space that encodes the Lagrange-D'Alembert principle. We justify the need for this 
type of brackets by working out the reduction of the celebrated Chaplygin sphere rolling problem. 
Our construction provides a geometric explanation of the Hamiltonization of the problem given by 
A. V. Borisov and I. S. Mamaev. 

1 Introduction and Outline 

The equations of motion for nonholonomic systems are not Hamiltonian. They can be formulated 
with respect to an almost nonholonomic Poisson bracket of functions that fails to satisfy the Jacobi 
identity. This formulation has its origins in [3Ull25|,[8]. and others. Roughly speaking, one constructs the 
bracket by projecting the canonical Poisson tensor on the cotangent bundle of configuration space onto 
the constraint space using the Lagrange-D'Alembert principle. This formulation is of interest because 
the reduced equations of motion of some important examples have been written in Hamiltonian form 
with respect a usual Poisson bracket (sometimes after a time rescaling), [H El [TJ [131 [T7] . I n this case 
we say that the problem has been Hamiltonized. An important example of Hamiltonization, although 
independent of bracket formulations is given in [15] . 

In this paper we construct more general almost Poisson structures and study their reduction. These 
brackets are obtained by projecting non-canonical bi-vector fields on the cotangent bundle of configu- 
ration space onto the constraint space using the Lagrange-D Alembert principle. We call these brackets 
Affine Almost Poisson Brackets since they are derived from a non-degenerate two-form that deviates 
from the canonical one by the addition of an affine term of magnetic type that annihilates the free 
Hamiltonian vector field. The idea of adding an affine term that "does not see the flow" already appears 
in [L2]. 
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Our motivation for considering this type of brackets was to link the general construction of almost 
Poisson brackets, [301 [25l [8], with the Hamiltonian formulation of the equations of motion for the 
celebrated Chaplygin sphere problem given in [4, 7J. Although the authors in [12] consider affine 
symplectic structures, the Hamiltonization of the problem could not be verified by their methods. 

Perhaps the most important result in the context of Hamiltonization is Chaplygin's reducing mul- 
tiplier theory [TO] that applies to nonholonomic systems with two degrees of freedom that posses an 
invariant measure. This theory does not apply directly to the Chaplygin sphere problem since the 
reduced space is 5-dimensional. However, associated to the symmetry there is a conserved quantity, 
the vertical angular momentum. This integral is used by the authors in [7J to apply Routh's method 
of reduction to the reduced equations and then apply a generalized version of Chaplygin's theory to 
Hamiltonize the problem. 

We give an alternative method to achieve the Hamiltonization of the Chaplygin sphere by carrying 
out the reduction of an affine almost Poisson bracket. The affine term in the bracket needs to be included 
for the conserved quantity to become a Casimir function of the corresponding reduced bracket. In fact, 
we also show that the reduction of the standard nonholonomic bracket defined in [301 [25] does not even 
yield a foliation of the reduced space by even dimensional leaves, having thus very different properties 
from a usual Poisson bracket. This shows that one should extend the notion of nonholonomic almost 
Poisson brackets as introduced in [30\ I25j. and generally considered in the literature, and incorporate 
the affine description if one is interested in obtaining reduced brackets with optimal properties. 

The outline of the paper is as follows. After reviewing the construction of almost Poisson brackets 
in section [21 we present a basic scheme for their reduction in section [3l In section [5] we introduce the 
notion of affine almost Poisson brackets and discuss their reduction in section [5j Section [6] treats the 
Chaplygin sphere problem. We use Cartan's moving frames for SO (3) to avoid messy calculations in 
Euler angles or other local coordinates. Finally, we give some closing remarks in section [71 



2 Background 
Nonholonomic Systems 

A nonholonomic system consists of a configuration space Q with local coordinates q a , a = 1, . . . , re, 
a hyper-regular Lagrangian C : TQ — > R, and a non-integrable distribution T> C TQ that describes 
the kinematic nonholonomic constraints. In coordinates the distribution is defined by the independent 
equations^ 

4(?)?o = 0, » = l,...,fc<n, (2.1) 

where the functions e\{q) are the components of the independent constraint one-forms on Q, e l := 
<£(?) dq a - 

The dynamics of the system are governed by the Lagrange-D'Alembert principle. This principle 
states that the forces of constraint annihilate any virtual displacement so they perform no work during 

1 Here and in what follows the Einstein convention of sum over repeated indices holds and we assume smoothness of 
all quantities. 
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the motion. The equations of motion take the form 

d { dC\ dC 



Xiel a = l,...,n. 2.2 

dt \dq a J dq a 

The scalar functions Aj, i = 1, . . . , k, are referred to as Lagrange multipliers, that under the assumption 
of hyper-regularity of the Lagrangian, are uniquely determined by the condition that the constraints 
([2,ip are satisfied. 

The equations (|2.2|) together with the constraints (|2.1|) define a vector field Yj? on T> whose integral 
curves describe the motion of the nonholonomic system. A short calculation shows that along the flow 
of Y^, the energy function := J£ q a — £, is conserved. 

We now write the equations of motion as first order equations on the cotangent bundle T*Q. Via 
the Legendre transform, Leg : TQ — > T*Q, we define canonical coordinates {q a ,p a ) on T*Q by the 
rule Leg : (q a ,Qa) l— ¥ (la, Pa = dC/dq a ). The Legendre transform is a global diffeomorphism by our 
assumption that C is hyper-regular. 

The Hamiltonian function, TL : T*Q — > R, is defined in the usual way H := E c o Leg' 1 . The 
equations of motion (|2.2[) are shown to be equivalent to 

5a = Pa = ~-K HAie a (g), a = l,...,n, (2.3) 

and the constraint equations (|2.ip become 

ei(q)^=0, i = l,...,k. (2.4) 

The above equations define the constraint submanifold M = Leg(P) C T*Q. Since the Legendre 
transform is linear on the fibers, M. is a vector sub-bundle of T*Q that for each q G Q specifies an 
n — k vector subspace of T*Q. 

Equations (|2.3[) together with (I2.4D define the vector field on A4, that describes the motion of 
our nonholonomic system in the Hamiltonian side and is the push forward of the vector field YV by 
the Legendre transform. The equations (I2.3P can be intrinsically written as 



i y x t*n Q = L*(dH + \ t T*e l ), (2.5) 

where Qq is the canonical symplectic form on T*Q, i : A4 T*Q is the inclusion and r : T*Q — > Q is 
the canonical projection. The constraints (I2.4p and their derivatives are intrinsically written as 

G C := n ^, (2.6) 

where J- is the distribution on T*Q defined as J- := {v G T{T*Q) : {r*e l ,v) = 0}. Here C is a non- 
integrable distribution on that is denoted by H in [2] and in [23]. We reserve these symbols for 
other objects. 

The following theorem was first stated in [32J and its proof can be found in [2] . 
Theorem 2.1. The point-wise restriction of l*VLq to C, denoted Vtc, is non- degenerate. 

Equivalently we can say that along M. we have the decomposition of T(T*Q) as Tm {T*Q) = C©C 0< 3 , 
where is the symplectic orthogonal complement of C with respect to fin. 
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The Almost Hamiltonian Approach 

Although energy is conserved, due to the nonholonomic constraints, the equations of motion (|2,2[) 
cannot be cast in Hamiltonian form. One can however write them with respect to a bracket of functions 
that fails to satisfy the Jacobi identity, a so-called almost Poisson bracket, see for example \30\ [25j [8] . 
There is also an almost symplectic counterpart. After factorization of external symmetries of a so-called 
G-Chaplygin system, the equations of motion can be written with respect to an almost symplectic form 
which is a non-degenerate two-form that is not closed, see [2l l23l |2"2] . 

Roughly speaking, the process of writing the equations of motion for a non-holonomic system in 
an almost Hamiltonian way amounts to eliminating the Lagrange multipliers from the equations of 
motion, and encoding the forces of constraint in a bracket of functions or a bilinear two-form. Once 
this is accomplished the constraints are satisfied automatically. The non-integrability of the constraint 
distribution is then reflected in the failure of the bracket to satisfy the Jacobi identity or in the failure 
of the bilinear two-form to be closed. 

The Standard Almost Symplectic Formulation 

In [2] the Lagrange multipliers are eliminated from the equations of motion (12. 5|) using theorem 12.11 
Since € C and the constraint forms T*e l vanish along C, then the vector field X££ is uniquely 
determined by the equation 

i Y M^C = dH c , (2.7) 

^nh 

where dUc denotes the (point-wise) restriction of dTL to C. The latter equation really resembles the 
structure of a Hamiltonian system except that Sic is n °t a two- form on A4 . 

The Standard Almost Poisson Formulation 

We give a definition of the Almost Poisson Formulation for nonholonomic systems that is convenient 
for our purposes. This formulation follows the general framework of Dirac brackets for systems with 
constraints and appears in the context of nonholonomic systems in [p5]. In [8 J the authors show that 
the bracket so obtained coincides with that of [301 125j . It is shown in [30] that this bracket is bilinear, 
anti-symmetric and satisfies Leibniz rule, but it satisfies the Jacobi identity if and only if the constraints 
are holonomic. Thus the name almost Poisson. 

Let V : Tm{T*Q) — > C be the projector associated to the decomposition T_m(T*Q) = C © C o. 

Proposition 2.2. Let f € C°°(A4) and let f £ C°°(T*Q) be an arbitrary smooth extension of f . Let 
Xj be the free Hamiltonian vector field defined by ixjSIq = df. Let X^ denote the unique vector field 
on A4 with values in C defined by the equation 

i x c tt c = (df)c, (2.8) 

where Sic an d (df)c denote respectively the point-wise restriction of Qq and df to C. Then, along A4, 
we have X c f = VXi. 
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Proof. Let m G A4. Applying V m to both sides of iXj(m){^Q)m = df(m) and pairing with an arbitrary 
v m G T m (T*Q) gives, 

C^m i X / (m)(^Q)m , «m> = C^m d/M , V m ). 

Since is associated with a symplectic decomposition, we have 

(Pm i X / (m)(^Q)m , %) = (^X f (m)(^Q)m , V m Vm) = (^Q)m(Xf(m) , V m %) 

= (HQlm^m I/(m) , %) = (Vp m X f (m){^Q)m , V m ) . 

It therefore follows that 

h m X f {m)^Q)m = V* m df(m). (2.9) 

By definition of V^, we have df(m) = (df(m))c m . Since / is an extension of /, the restriction 
of df to TM. agrees with df. In particular, the same is true about restriction to C m C T m Ad. Therefore 
{df(m))c m = (df(m)) Cm . Moreover, since V m Xj{m) G C m we can replace (flg) m by {VL c ) m in §2M to 
get 

h m x f (m){^c)m = (df(m)) Cm . 
By non-degeneracy of (0c) m we get V m Xj{m) = Xf(m) as required. □ 

In view of (12. 7p and the above proposition, it follows that the nonholonomic vector field X^ satisfies 
■^nh = PX-h, where X-h is the free Hamiltonian vector field defined by ix h Qq = dTi, and the equality 
makes sense on A4. 

Let / G C°°(M) and let / G C°°(T*Q) be an arbitrary smooth extension of /. For m G M. we 
have: 

X^(f)(m) = (df(m),V m X H (m)) = (n Q ) m (X f -(m),V m X n (m)) 

= (n Q ) m {V m X f -(m),V m X n {m)), (2.10) 

where the last identity follows from the fact that the projector V m is associated to the symplectic 
decomposition T m (T*Q) = C m © C m Q . Inspired by this calculation we define the following bracket of 
functions /i, j% G C°°{M): 

{fi,f 2 }M(m) := (n Q ) m (V m X fi (m),V m X f2 (m)) = (d/i(m) ,V m X ^(m)) , (2.11) 

where /i,/2 G C°°(T*Q) are arbitrary smooth extensions of /i,/2- The value of the bracket is in- 
dependent of the extensions by proposition 12.21 We will refer to the above bracket as the standard 
nonholonomic bracket to distinguish it from the affine nonholonomic bracket to be introduced in sec- 
tion m 

Associated to every function fi G C°°(M) we define its (almost) Hamiltonian vector field on A4, 
denoted X^ , by the rule 

X J g{h){rn) = {f 2 ,h} M {m) for all fi G C°°(M). 

Denote by TCm is the restriction of TL to M. The equations of motion for the nonholonomic system 
can be written in terms of the standard nonholonomic bracket as: 

X£(f)(m) = Xtf(f) = {f,n M }M(m) for all / G C°°(M). 
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3 Reduction of Standard Nonholonomic Brackets 



We present a basic scheme of reduction of nonholonomic systems, by reducing the nonholonomic stan- 
dard bracket. Our discussion is global and intrinsic and will be extended for affine brackets ahead. A 
similar discussion is outlined in [25] . In [2l] local expressions are given for the reduced bracket and the 
link with Lagrangian reduction is made. See also [T7] for other specialized cases of intrinsic reduction 
of almost Poisson brackets for nonholonomic systems on Lie groups. 

Definition. Let H be a Lie group that defines an action <I> : H x Q — > Q. We say that H is a 
symmetry of our nonholonomic system if $ lifts to a free and proper action on TQ that leaves the 
constraint distribution T> C TQ and the Lagrangian C : TQ — ► R invariant. 

Suppose that H is a symmetry group of our nonholonomic system and denote by Vl/ : H x T*Q — ► 
T*Q the cotangent lift of $. By the definition of cotangent lift, it follows that leaves the constraint 
submanifold, A4, and the Hamiltonian, TL : T*Q — ► R, invariant. The proof of the following proposition 
is left to the reader: 

Proposition 3.1. The distribution T = {v G T{T*Q) : (r*e l ,v) = 0} C T(T*Q) is invariant under 
the lift ofy to T(T*Q). 

Since A4 is invariant under ^, the action naturally restricts to A4 and TA4 is invariant under the 
tangent lift of It follows from the above proposition that the restricted action to A4 preserves the 
distribution C = TM D T . Since ^ is the lift of a point transformation it is symplectic, so it also 
preserves C n ® . As a consequence, it follows that for all m G M and h G H the following diagram 
commutes: 

T m (T*Q)^T* h{m) (T*Q) 



V< S h (m) 



T m {T*Q)—>Tv h{m) {T*Q) 

J m ^ h 

It is now routine to check that ^ preserves the nonholonomic bracket. That is, for functions /i,/2 G 
C°°(M), and he H,we have 

{/i o / 2 o qt h } M = {f 1 ,f 2 } M o (3.12) 

Let the smooth manifold TZ = M./H denote the reduced space and n : M. — > 7?. denote the orbit 
projection. In view of (|3.12p the following reduced standard nonholonomic bracket for functions Fx, i*2 G 
7?. is well defined: 

{Fl,F 2 }^(7r(m)) :={Fi0 7r,F 2 ovr}^(m). (3.13) 
For a function Fi G C°°(TZ) we define its (almost) Hamiltonian vector field, on 7£ by the rule, 

X£(F 2 ) = {F 2 , FxIk, for all F 2 G C°°(7i). 

Since the Hamiltonian TL (and hence also the constrained Hamiltonian TLm) are invariant, it follows 
that the nonholonomic vector field = Xff M pushes forward by it to the reduced nonholonomic 
vector field X^ satisfying 

X^ (F) = X^ (F) = {F, Hn}n for all F G C°° (TZ) , 
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where the reduced Hamiltonian Hn € C°°(TZ) is uniquely defined by the condition Hm = Tin ?r- 
Summarizing, we have 

Theorem 3.2. Suppose that the Lie group H is a symmetry group for our nonholonomic system. Then 

1. The lifted action ^> on T*Q preserves the standard nonholonomic bracket {•, -}m in the sense of 

2. The smooth reduced manifold 1Z = A4/G is equipped with a reduced standard nonholonomic 
bracket, {•, - }ti, uniquely characterized by equation 13.13\) . 

3. The nonholonomic vector field, X^, is -n -related to the (almost) Hamiltonian vector field X^ 
associated to the reduced Hamiltonian H-ji- 

It is easily shown that the reduced standard nonholonomic bracket, {■,•}■£> is almost Poisson. 
A very interesting question is whether it in fact satisfies the Jacobi identity thus yielding a direct 
Hamiltonization of the problem. It is shown in [T7] that this is indeed the case for the Suslov problem 
and the Chaplygin sleigh. A less stringent condition is the existence of a strictly positive function 
/i : Q/H C 1Z — > M such that the new bracket of functions 

{F 1 ,F 2 }f^:=n{F l ,F 2 } n , 

satisfies the Jacobi identity. In this case we call fi a conformal factor and the reduced equations can 
be written in Hamiltonian form after the time rescaling dt = fidr. Hamiltonization in this way is more 
likely to be accomplished if the reduced space is low-dimensional and should not be expected in general. 
A necessary condition is that the characteristic distribution, U C T1Z, of the reduced standard bracket, 
defined by 

U z = {Xf{z) : F G C°°(K)} C T Z K, 

is integrable. This is a simple consequence of the symplectic stratification theorem for Poisson mani- 
folds. In section [6] we will show that the characteristic distribution of the reduced standard bracket for 
the Chaplygin sphere is non-integrable, and thus, a conformal factor that renders this bracket Hamilto- 
nian cannot exist. As mentioned in the introduction, this example lead us to explore the more general 
concept of affine almost Poisson brackets and their reduction. 



4 Affine Almost Poisson Brackets 



In this section we will construct the affine almost Poisson brackets. We begin by introducing the notion 
of an Affine Almost Symplectic Structure for a nonholonomic system. The idea is that the equations 
of motion for our nonholonomic system (|2.5|) are equivalently written as 




where £Iq is any two-form on T*Q satisfying i y m t*^o = 0. We will require the form Qn = fin + 

to be non-degenerate but we will not ask for it to be closed. After all, the closeness of the canonical 
two- form J7q was never used in the construction of the standard nonholonomic bracket. 

For our motivating example, the Chaplygin sphere, it is through the reduction of an affine almost 
symplectic structure and a time rescaling that the system can be Hamiltonized. 
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The Affine Almost Symplectic Formulation 

We begin by giving our working definition of an affine almost symplectic structure. 

Definition (Affine Almost Symplectic Structure). A nontrivial two- form 0,q on T*Q defines an Affine 
Almost Symplectic Structure, Q,q := VLq + for our nonholonomic system if the following conditions 
hold: 

1. i Xn = 0. 

2. The form £Iq is semi-basic in the sense that it vanishes on vertical vectors. That is, if v is a 
tangent vector to T*Q such that t*v = 0, with r : T*Q — > Q denoting the canonical projection, 
then i^ Qq = 0. 

Condition 1 means that the form f^o "does not see" the free Hamiltonian vector field X-^ . Condition 
2 means that the form fig differs from the canonical form by the addition of a magnetic type term. The 
following proposition shows that all the properties of fig that are relevant for the almost Hamiltonian 
formulation of nonholonomic systems are shared by CIq. 

Theorem 4.1. Let Qq := Qq + Qo be an Affine Almost Symplectic Structure for our nonholonomic 
system. The following statements are true: 

1. The affine almost symplectic structure (Iq is non- degenerate. 

2. The point-wise restriction of Qq to C, denoted tie, is non- degenerate. 

3. The nonholonomic vector field satisfies \^j^l*VLq = i-^M^^Q- 

nh nh 

4- The symplectic complement of J- with respect to Qq, denoted J^ 1 ® , equals the symplectic comple- 
ment of T with respect to VLq, denoted J&Q . That is J&Q = T n Q . 

Proof. For p q G T*Q, denote by V Pq C T Pq (T*Q) the subspace of vertical vectors, i.e. V Pq = {v Pq € 
T Pq {T*Q) : T*v Pq = 0}. It is well known that V Pq is a Lagrangian subspace with respect to Qq, i.e. 
V Uq - V 

V Pq — V Pq- 

Let w Pq £ T p (T*Q) be such that (Iq (w Pq , v Pq ) = for all v Pq € T Pq (T*Q). In particular, since 

vanishes on vertical vectors, it follows that (w Pq , v Pq ) = for all v Pq € V Pq , so w Pq 6 V p Q q Q = V Pq . 
Using again that VLq vanishes on vertical vectors we get Q,Q(w Pq ,v Pq ) = for all v Pq G T Pq (T*Q) which 
by non-degeneracy of Qq implies w Pq = and we have proved 1. 

Now, for m £ M the intersection V m n C m is a Lagrangian subspace of C m with respect to (^c)m- 
This follows from the identity (V n C) n « n C = (V n Q + C n «) DC = V nC. Repeating the argument in 
the above paragraph shows part 2. 

To prove 3 start by defining the vector field X-j-c by the equation dH = \<> £1. We claim that the 
vector fields X^ and are equal. Indeed, writing X-^ = X^ + Y^, and since ix H ^o = 0, we have 

dU = h^Q = { x n ^Q + ir n ^Q = dH + ir n &Q- 
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It follows that iy H ilg = which implies 1^ = by non-degeneracy of shown above. 

From the intrinsic form of the equations of motion (|2.5I) we deduce that along M. we can write 
= X-h + Zn-c = Xn-i + Z-h , where the constraint force vector field Z-^ € JF n< ? . Since V Pq C T Pq 

for all p q € T*Q, then T Pq Q C V Pq Q = V Pq and we conclude that Zn-t is vertical. In view of the above 
observations and using properties (1) and (2) in the definition of the affine almost symplectic structure 
(Iq we find 

^nh ^-nh 

Finally, to prove 4, notice that since f2o vanishes on V Pq , we have V Pq Q = V Pq Q = V Pq . Now, 
by definition of T we have V Pq C T Pq for all p q € T*Q. It follows that T Pq Q C V P ^ Q = V Pq and 

J^ q Q C Vj^ Q = V Pq . Since the forms S7q and Qq agree when contracted with elements in V Pq the result 
follows. □ 

Point 3 in the above theorem shows that starting from (|4.14p and by a reasoning analogous to the 
discussion in section [2J the vector field X^ is uniquely determined by the equation: 

i Y M&c = dH c . (4.15) 

^-nh 

We finish the section with a small digression. It is seen in the proof of point 3 in the above theorem, 
that the reaction force bundle is the vector bundle over T*Q whose fibers are given by J^Q. Point 4 
of the above theorem shows that this bundle can also be written as the symplectic complement of J- 
with respect to fig. This reinforces the idea that fig has indeed all of the relevant properties for the 
description of nonholonomic systems that Qq has. 

The Affine Almost Poisson Formulation 

Let Qq = Qg + be an affine almost symplectic structure for our nonholonomic system. In view of 

theorem 14.11 we have the symplectic decomposition T m (T*Q) = C m © Cm Q ■ Let V : Tm{T*Q) — * C be 
the projector associated to this decomposition. Then, analogous to proposition 12.21 one shows, 

Proposition 4.2. Let f € C°°(M) and let f E C°°(T*Q) be an arbitrary smooth extension of f. Let 
Xj be the vector field on T*Q defined by i^Stq = df. Let X^ denote the vector field on J\A with values 
in C defined by the equation 

' l xj &c = (df)c, 

where Vtc and (df)c denote respectively the point-wise restriction of £Iq and df to C. Then, along M, 
we have Xj = VXj. 

In view of (|4.15j) and as a consequence of the above proposition, the nonholonomic vector field X££ 
satisfies X^ = VX n . 

Analogous to (|2.10p we have: 

X^(f)(m) = (df(m),P m X n (m)) = (n Q ) m (X f (m),P m X n (m)) 

= (n Q ) m (V m X f (m),V m X H (m)), (4.16) 
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where, for the last identity to hold, we have used the fact that V is associated to a symplectic decomposi- 
tion with respect to Q,q. We now define the affine nonholonomic bracket for functions fx, f 2 G C°°(A4): 

{/i,/ 2 } M H := (n Q ) m (P m X fi (m),V m X f - 2 (m)) = {dfx(m),V m X h {m)), 

for arbitrary extensions fx, fi G C°°(T*Q) of fx, f 2 - The affine nonholonomic bracket is well defined in 
view of proposition 14.21 

Associated to every function fx G C°°(M) we define its (almost) affine Hamiltonian vector field on 
M, denoted Xf^, by the rule 

X^(f 2 )(m) = {f 2 ,f 1 } M (m) for all f 2 G C°°(M). 

Notice that in general {fx,f2}M {/ij/2}^ and consequently, for a general / G C°°(M), the 
vector fields Xf 4 and Xf 4 are different. However, in view of (I4.16P we see that 

XX(f)(m) = Xtf M (f)(m) = Xtf M (f)(m) = {f,H M } M (m), for all / G C°"(M), 
so we can also write the equations of motion for the nonholonomic system in terms of the affine bracket. 

5 Reduction of Affine Nonholonomic Brackets 

The reduction scheme presented in section [3] and summarized in theorem 13.21 holds for affine brackets 
but we need to ask that the lifted action to T*Q leaves the affine term f^o invariant. 

Theorem 5.1. Suppose that H is a symmetry group for our nonholonomic system. Suppose, in addi- 
tion, that there is an affine almost symplectic form Q,q = Qq + f^o f or our nonholonomic system and 
that the form f^o is invariant under the cotangent lifted action ^> : H x T*Q —* T*Q. Then 

1. \P preserves the affine bracket {-,-}m * n ^ e sense that for all fx^fi G C°°{M.) and h G H we 
have 

{/i o f 2 o * h ) M = ih, h} M o */,. (5.17) 

2. The smooth reduced manifold 1Z = M/G is equipped with a reduced affine nonholonomic bracket, 
{•, -} K , uniquely determined by the relation 

{F 1 ,F 2 } n (7r(m)) = {F 1 o 7 r,F 2 o 7T } M (m), for Fx,F 2 G C°° (1Z) , m e M, 

and where tt : A4 — > 7Z denotes the orbit projection. 

3. The nonholonomic vector field, X^, is n -related to the (almost) Hamiltonian vector field X^ 
associated to the reduced Hamiltonian Tin, that is uniquely determined by the condition T~Lm = 
Tin o tt. 
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In point 3 of the above theorem, and in what follows, we have denoted by Xp the (almost) Hamil- 
tonian vector field corresponding to F € C°°{1Z) defined in terms of the reduced affine nonholonomic 
bracket {•, -}^. 

Just as in the reduction of standard nonholonomic brackets discussed in section [3l it is straight- 
forward to check that the reduced affine nonholonomic bracket, l/,-}^, is an almost Poisson bracket. 
The question remains of whether it in fact satisfies the Jacobi identity, maybe after multiplication by 
a conformal factor. We will show that this is indeed the case for the Chaplygin sphere. 



6 Example: The Chaplygin Sphere 

We will illustrate the need for affine almost Poisson brackets by working out the almost Poisson reduc- 
tion of the celebrated Chaplygin sphere problem with the standard nonholonomic bracket and with an 
affine bracket. We will show how Hamiltonization after reduction can only be achieved by starting out 
with an affine bracket. 

The Chaplygin sphere problem concerns the motion of an inhomogeneous sphere whose center of 
mass coincides with its geometric center that rolls without slipping on the plane. The configuration 
of the system is Q = 50(3) x R 2 . An element q £ Q will be denoted by q = (g; (x,y)). Here (x,y) 
give the cartesian coordinates of the center of the sphere on the plane, and g £ 50(3) specifies the 
orientation of the sphere by relating, at any given time, a fixed space frame to a moving body frame 
that will be assumed to be aligned with the principal axes of inertia of the sphere. These two frames 
define respectively the so-called space and body coordinates. 

The key aspect that renders the study of the Chaplygin sphere interesting is that the kinetic energy 
writes naturally in terms of body coordinates while the constraints are naturally written in space 
coordinates. This is related to the fact that the problem is an LR system in the sense of [31 J when 
considered on the direct product Lie group Q = 50(3) x R 2 . As a consequence, we will be forced 
to work with both the right and the left trivializations for 50(3). Throughout the section we will 
constantly write formulas with respect to both coordinate systems. 



A Moving Frame Approach 

To avoid working with Euler angles or other local coordinates for 50(3) we will use moving frames to 
describe the system globally. 

Identify the Lie algebra q = so (3) with R 3 using the hat-map, 

I -v 3 v 2 
v = (vi,v 2 ,v s ) l-> V = V 3 -Vl 

\ -V 2 Vi 

The above is a Lie algebra isomorphism with the commutator in R 3 being the usual vector product. 

Let {ei,e2,es} be the canonical basis for the Lie algebra g = R 3 . The rolling takes place on a plane 
parallel to {ei,e 2 } and normal to e 3 . The coordinates of the contact point on the table with respect 
to the basis {ei,e2} are (x,y) £ R 2 . 
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The moving frame {X[ lght (g), X" ght (<7), X!j lght (<7)} that forms a basis for T g SO(3) is obtained by 
right translation of {ei,e2,e3}. The dual co-frame will be denoted by {p±(g), P2(d), Pz(g)}- Similarly, 
left translation of {ei,e 2 ,e 3 } by g e 50(3) defines the moving frame {X x f t {g),X l f t {g),X l f t (g)} that 
forms a basis for T g S0(3) whose dual co-frame will be denoted by {^i(g) , ^2(9) , ^3(9)} ■ 

For a tangent vector v g £ T 9 50(3), the corresponding angular velocity in space coordinates is the 
element w s 6 g = I 3 , obtained by right trivialization. Its components are defined by 

v g = ulX^ h \g) + u s 2 X^ h \g) + ^ ig %), 

or, equivalently, by u>f = (pi(g),v g ). Analogously, the angular velocity in body coordinates is the element 
io b € q = M 3 , obtained by left trivialization and whose components are given by u\ = (Xi(g),v g ). 

These two vectors are related by the Adjoint map: Ad g := T e (L g o R g -i) : Q — > g. We have 
uj b = Ad s -iuA Define the coefficients gij : 50(3) — > M by Ad fl -iej = gij&j- For v G K 3 = g we have 
Ad 9 v = gv. It follows that J 3 = g^ 1 ^ and that g^ equals the i, j component (i th row, j th column) of 
the matrix g £ 50(3). Since g^ 1 = g T we have gug k j = an d we can write 

u) = gijuf, uf = g^u), 

or equivalently, Xj(g) = gijPi{g), Pi{g) = gijXj{g). 

Denote by cm the structure constants of the Lie algebra defined by [ej,efc] = Qfe/e/. We have 
Ciki = if two of the indices are equal, C{ k \ = 1 if (i,k,l) is a cyclic permutation of (1,2,3) and 
Ciki = — 1 otherwise. The following proposition will be used in what follows. 

Proposition 6.1. We have dgij = c ik igijp k = ci kj gu\ k . 

Proof. Let {/i,/2,/3} be the dual basis to {e\,e2,e^}. We can write g^ = (fj, Ad g -iej). Therefore, 

d 9ij(X k ght ) = ^\ t=0 (fj,Ad {cxp(ekt)g) -iei) = (e J ,Ad g -i[ei,e k ]) 
= c ik i{fj,Ad g -iei) = c ik igij. 

For an arbitrary v g G T s 50(3), writing v g = (p k , v g )X r k 1&ht , and using the above equation shows the 
result. The proof of the other identity is analogous. □ 

The Chaplygin Sphere Problem 

At a given point q = (g; (x,y)) <G 50(3) x M 2 we can use either {X" ght (g), d x , d y } or {X\ eft (g) , d x , d y } 
as basis for the tangent space T q Q. A tangent vector v q £ T q Q can be written as 

v q = ujfxf^ig) + v x d x + v y d y = u\xf h {g) + v x d x + v y 8 y . 

The Lagrangian C : TQ — > R is of pure kinetic energy and in body coordinates is given by 

C(v q )= l -{lJ).J+ l -m(vl + v^). (6.18) 
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Here I is the inertia tensor which, under our assumption that the body frame is aligned with the 
principal axes of inertia of the body, is represented as a diagonal 3x3 matrix whose positive entries, 
Ii, are the principal moments of inertia, and " • " denotes the canonical scalar product on M 3 . 

The rolling constraints are: 

v x = ru;|, v y = -rwj, (6.19) 

where r is the radius of the sphere. We have a three dimensional constraint distribution V C TQ 
defined as the annihilator of 

e x = dx-rp 2 , e y = dy + rp 1 . (6.20) 

A basis for T> q is {Xl lght (g) — rdy,X T 2 lg]lt {g) + rd x , X^ lght (g)}. The first two vector fields in this basis 
define rolling motions in the x and y directions with the accompanying rolling motion. The third one 
defines twisting motion, where the sphere spins about the vertical axis but stays put in the table. 

We now pass to the Hamiltonian formulation. The Lagrangian £,, being hyper-regular, defines a 
Riemannian metric on Q. For q € Q, we have a natural isomorphism Leg 9 : T q Q — > T*Q, the Legendre 
transform of C The Lagrangian then writes as £(v q ) = ^(Leg q (v q ),v q ) for v q G T q Q, where (•, •) is the 
duality pairing. 

To compute explicitly the Legendre transform, notice that at the point q = (g; (x, y)) € SO(3) x M 2 
we can use either {p\(g), p2(g), ps(g),dx,dy} or {X\(g), A2G5O, ^(g), dx, dy} as basis for the cotangent 
space T*Q. A cotangent vector a q € T*Q, can then be written as 

a q = M*pi(g) + p x dx + p y dy = M b \i(g) + p x dx + p y dy. 

The vectors M s and M b with entries Mf and M\ are, respectively, the angular momentum of the ball 
with respect to its center expressed in space and body coordinates. They define elements in the dual 
Lie algebra of SO(3) and are related by M b = Ad*_iM s . With the identification of R 3 and (M 3 )* 
via the euclidean pairing, we can write M & = g~ 1 ~M. s . The vector (p x ,p y ) is the linear momentum of 
the center of mass of the ball. We have thus defined (NL s ;p x ,p y ) and (M. b ;p x ,p y ) as possible sets of 
coordinates for T*Q. We will refer to them as space and body coordinates respectively. 

The Legendre transformation maps the vector v q £ T g Q to the covector a q € T*Q according to the 
rule: 

M b = loj h , M s = glg^ 1 ^, p x = mv x , p y = mv y . 

The rolling constraints write in the Hamiltonian formulation as, p x = mru^, p y = — mrwf, with 
uj'1,lo2 written in terms of M 6 or M s by means of the inverse Legendre transform. The constraint 
manifold M is defined as the set of a q € T*Q such that these equations hold. 

The Hamiltonian H : T*Q — > R is given in body coordinates by 

H{a q ) = \m b .(i-^m b ) + ^-(pl+pl). 

The canonical one-form on T*Q writes a^l Qq = M b \i + p x dx + p y dy in the left trivialization, 
and as @q = Mfpi + p x dx + p y dy in the right one. Using Cartan's structure equations one has 

2 To simplify notation, and hoping that there is no danger of confusion, we also denote by Ai, pt the pull-backs 
t* pi by the canonical projection r : T*Q — > Q. 
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d\\ = — A2 A A3, . . . , dpi = pi Ap 3 , . . . (cyclic), and we find the following expressions for the canonical 
two- form Qq, 

VLq = -d@Q = A fc A dM b k + M\\ 2 A A 3 + M|A3 A Ai + M\\\ A\ 2 + dxA dp x + dy A dp y 

= p k A dMl - M{p 2 A p 3 - M^p 3 Api- M|pi A p 2 + dx A dp x + dy A dp y , (6.21) 

The free Hamiltonian vector field X-h, defined by \x h Qq = dTi, is then shown to be given bjH 

X n = c kij My k d M p + u\xf h + ^d x + ^d y , (6.22) 
j j i mm 

in body coordinates. The above vector field defines the usual free rigid body equations for a body whose 
center of mass lies on the (x, y) plane. This is, of course, trivial; in the absence of rolling constraint 
forces, the sphere is just a free rigid body. This type of motion will generally not satisfy the constraints. 

A Change of Coordinates on the Fibers 

We introduce a change of coordinates on the fibers of T*Q inspired by [30]. See [22] for an interesting 
discussion of this kind of change of coordinates in the context of moving frames. In our particular case, 
the change of coordinates has a precise physical meaning; it corresponds to working with the angular 
momentum with respect to the contact point, instead of the angular momentum with respect to the 
center of mass. 

Working in space coordinates, and since a basis for V q is {X" ght — rd y ,X2 ght + rd x , X" ght }, fol- 
lowing [30] we define at T*Q the new (space) fiber coordinates, (K s ;m x ,m y ), in terms of our existing 
coordinates, (M. s ;p x , p y ), by 

K{ = Ml - rp y , Kl = Mi + rp x , K| = M|, m x = p x , m y =p y . (6.23) 

In body coordinates, we will consider the new fiber coordinates (K b ;m x ,m y ) with K 6 := g -1 K s . 

Along the constraint manifold A4 we have p x = mruj^ and p y = —mruf. Substituting these 
expressions into (|6.23|) . a short exercise in classical mechanics shows that the vector K s (resp. K 6 ) 
is indeed the angular momentum of the ball with respect to the contact point written in space (resp. 
body) coordinates. Moreover, by writing M s (resp. M b ) in terms of uj s (resp. uj b ), one can easily 
derive the expressions, 

K s = ^IcrV + mr 2 (co s - (lo s ■ e 3 )e 3 ), (6.24) 
K b = luo b + mr 2 {J -(J -7)7), (6.25) 

where again " • " denotes the usual dot product in M 3 and the Poisson vector, 7 S M 3 , is defined by 

7 := 5 _1 e 3 or, in components, 7, := g 3i . 

Physically, 7 is the vertical unit vector expressed in body coordinates. For a Lie algebraic interpretation 
of 7 see [28J . 

3 As with the previous remark, we ask the reader to interpret Xl cCt ,d x ,d y in the right way. They are tangent vectors 
to T*Q with no component in the d M b , 9 Px , d Py directions, that push down to the old X\ e{t , d x , d y by r : T*Q — > Q. 
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Expressions (|6.24p and (I6,25p can be inverted to write ui s (resp. ui b ) in terms of K s (resp. K b ). 
To obtain these expressions explicitly let E denote the 3x3 identity matrix and let A := (I + mr 2 E). 
Taking the dot product on both sides of (|6.25p with A~ 1 j yields, 

o;| = cj b • 7 = ' , with Y(j) := 1 - mr 2 (j ■ A^j). (6.26) 



It follows that 



and 



J = A~ l K b + mr 2 (^-A^A A~ 1 ^, (6.27) 

Y [V J 



<S = gA-^K* + mr 2 ( (g ~ \ A ~ A gA-\ 



The dimensionless quantity Y("f) will turn out to be very important in the context of Hamiltoniza- 
tion. It is seen to be strictly positive since ||7|| = 1 and all of the principal moments of inertia Ii > 0. 

Along the constraint submanifold Ai, the above equations allow us to write, via the constraint 
equations m x = p x = mrw|, m y = p y = — mrwf, the quantities m x and m y as functions of (g, K s ) or 
(g,~K b ). We can therefore use ((<?; (x,y)); ,K S ) or ((g; (x,y)),~K b ) as induced coordinates for M.. The 
constrained Hamiltonian TCm := H.\m can De written in these coordinates as 

H M = \k s ■ lo s = ±K b ■ co b . (6.28) 

The SE(2) Symmetry 

Since the time evolution of the system is independent of the position of the origin and the orientation 
of the axes on the plane where the rolling takes place, it is natural to expect the left multiplication by 
elements in H = SE{2) to be a symmetry group for the problem. 

We represent SE{2) as the subgroup of GL^(W) consisting of matrices of the form 



/ a \ 

h b 


V o o o i / 



h 

where h € SO (3) is of the form h = | 

1 



with h € SO(2) and a, b G R. The generic element in se(2) will be denoted by (h\ a, b). The action on 
an element q = (g; x, y) £ Q is defined as: 

(h;a,b) : {g;x,y) — > (hg; (x, y)h l + (a, b)) G Q. 

Proposition 6.2. With the above definition of its action on Q, SE(2) is a symmetry group for the 
Chaplygin sphere system. 
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Proof. We need to show that the lifted action to TQ leaves both the constraints and the constraint 
distribution invariant. Working with body coordinates, the lifted action to TQ maps the tangent vector 
v q = (u! b ;v x ,Vy) G T q Q according to the rule 

(h;a,b) : (uj b ; v x , v y ) — ► {u h ;{v x ,v y )h t ) G T {h - a ,by q Q, 

and it is immediate to check that the Lagrangian (|6.18p is invariant. Working in space coordinates, 
the lifted action to TQ maps the tangent vector v q = v x , v y ) G T q Q according to the rule 

(h; a, b) : (uj s ; v x , v y ) — ► (Ad h uJ s ; (v x ,v y )h t ) = (huj s ; (v x ,v y )h t ) G T {h . a>b y q Q , (6.29) 

and it is readily shown that the rolling constraints (|6.19p are invariant. □ 

According to theorem l3.2l the reduced space 1Z := M./ SE{2) is equipped with the reduced standard 
nonholonomic bracket {-,-}tz, and the reduced equations can be written with respect to this bracket. 
The properties of this bracket will be studied further ahead. 

We claim that associated with this symmetry there is a conservation law: the vertical component of 
angular momentum about the center of mass of the sphere, M|, (which agrees with vertical component 
of the sphere's angular momentum about the contact point, is constant throughout the motion. 
Contrary to usual holonomic mechanics, the relationship between symmetries and conservation laws is 
not straightforward for nonholonomic systems, (see [3] for a thorough discussion). To show our claim 
we could follow [3] and compute the momentum equation but we choose to invoke instead the following 
nonholonomic version of Noether's theorem whose proof can be found in PQ: 

Theorem 6.3. Let H be a Lie group that acts on Q with Lie algebra \] and dual Lie algebra f)*. For £ G 
I), denote by £q(<?) G T q Q the infinitesimal generator of the action on Q, and by £,T*Q(ctq) £ T aq (T*Q) 
the infinitesimal generator of the lifted action to T*Q. If the Lagrangian C is invariant under the 
lifted action to TQ and if £,q(q) G T> q for all q G Q, then the components of the momentum map, 
J H ■ T*Q -> fj* ; defined by 

d(J H (a q ),0 = ie T « Q (a ? )^Q, 

are constant during the motion. 

Notice that, contrary to the definition of a symmetry group for a nonholonomic system given in 
section in the above theorem we do not require the constraint distribution to be invariant under the 
action. Notice as well that the momentum map, Jh ■ T*Q — > [)*, always exists since we are working 
with a lifted action, see [26J. 

To apply the theorem we consider the twisting action of 50(2) on Q. Represent 50(2) as the 
subgroup of 50(3) consisting of matrices of the form, 

h = I ~ k V with h G 50(2). 
V 1 J 

The twisting 50(2) action on Q is defined by: 

h : (g;(x,y)) — ► (hg;(x,y)), for (g; (x,y)) G Q. 

The proof of the following proposition is left to the reader. 
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Proposition 6.4. The twisting SO(2) action on Q satisfies the hypothesis of theorem \6.3\ and the 

associated conserved quantity is M| = = M b • 7 = K fc • 7. 

After this digression on conserved quantities we come back to the discussion of the SE{2) symmetry 
with the aim of performing the corresponding reduction as described in section [3j In view of (|6.29p . 
the cotangent lift of the action to T*Q expressed in space coordinates maps the cotangent vector 
(M. s ;p x ,p y ) G T*Q according to the rule 

(h;a,b) : (M s ;p x , Py ) — ► (AdJ^M"; (p^Pyff) = (hM s ; (jp^Pyfh 1 ) G T* {h . ab) . q Q. (6.30) 

In view of (|6.3U|) and the transformation rules (|6.23|) . the cotangent lift of SE{2) to T*Q is expressed 
in the new space fiber coordinates as the map acting on (K s ; m x , m y ) G T*Q by 

(h; a, b) : (K s ; m x , m y ) — ► (Ad^K 8 ; (m x , m y )/?) = (hK s ; (m x ,m v )h t ) G T* {h . a b) . q Q. 

Consequently, the action in the new body coordinates (K. b ;m x ,m y ) G T*Q is given by 

(h;a,b) : (K b ;m x ,m y ) — > (K b ; (m^mjjii') G T* h . ab y q Q. 

Since the action preserves the constraint manifold A4, the restricted action on A4 is represented in 
the induced body coordinates ((g; (x,y));~K b ) for M. as 

(h;a,b) : ((g; (x,y)); K b ) — > ((hg; (x, y)/i* + (a, b)); K b ). 

Notice that the action leaves K b and the components of the Poisson vector 7 (the third row of g) 
invariant. The latter are not independent as H7H = 1, but we can use (K b ,7) as redundant coordinates 
for the reduced space 1Z := M/SE(2) = M 3 x S 2 . In what follows we will represent 1Z as the embedded 
submanifold in M 6 = {(K 6 ,7) : K b G R 3 , 7 G M 3 } defined by the condition ||7|| = 1. 

The reduced Hamiltonian Ti-ji : 7Z — > R is given by 

Hn(K b , 1 ) = ±K b -u; b , (6.31) 

with u b given by (|6.27j) . Finally notice that the constant of motion, M| = = K b • 7, is invariant 
under the action since it can be written in terms of the coordinates (K fe ,7). 

An Affine Almost Symplectic Structure for the Chaplygin Sphere 

Let v denote the dimensionless, bi- invariant volume form on £0(3), oriented and scaled such that for 
the canonical vectors ei,e2,e3 G M 3 we have u(ei, e2, e^) = 1. Since Q = SO {3) x R 2 , v naturally 
defines a three-form on Q that, via the cotangent bundle projection, r : T*Q — ► Q, pulls back to a 
basic three- form v on T*Q. 

Denote by the free Hamiltonian vector field of the sphere (I6.22|) and define the two-form Qq on 
T-Q by 

Oo := —mr 2 ix n v. (6.32) 
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Proposition 6.5. The two-form £Iq := £Iq + Qq on T*Q defines an affine almost symplectic structure 
for the Chaplygin sphere problem. 

Proof. It is clear that 0,q is a semi-basic two-form since it was constructed by pulling back a form on Q 
and then contracting with X^. It is also clear that ix H ^o = so the two conditions in the definition 
of an affine almost symplectic structure are satisfied. □ 

Therefore, according to the theory developed in section [H there is an affine nonholonomic bracket, 
{-,-}m> associated with the affine almost symplectic structure Uq. The following proposition shows 
that the hypothesis in theorem 15.11 are satisfied so we can reduce the system in terms of an affine 
reduced nonholonomic bracket {•, -} n on 1Z = M./ SE{2). 

Proposition 6.6. The form Q,q defined above is invariant under the cotangent lift of the SE{2) action 
to T*Q. 

Proof. For £ G se(2), denote by £t*q the infinitesimal generator of the action on T*Q. We have 

£^ T , Q n = -mr 2 (i[z T * Q ,x n \v + ix n £z T * Q v) ■ 

Since the Hamiltonian TL : T*Q — > M is invariant under the action we have [£,t*q, = 0. We also have 
£^ T , Q D = by invariance of v. Thus, £^ TtQ Qo = 0, and the result follows since Q is connected. □ 

Since the three-form v is given by v = \\ A A2 A A3 in body coordinates. In view of (16. 22ft we find 
the following explicit formula for Qo-> 

n = -mr 2 {uj\\2 A A 3 + u^A 3 A Ai + A A 2 ). (6.33) 
The Reduced Brackets 

We now study the reduced brackets {•, - }tz, and {•, -}^, on the reduced 5-dimensional space 1Z = M 3 x S* 2 . 
We will show that the characteristic distribution corresponding to the reduced standard nonholonomic 
bracket, {-,-}tz is non-integrable, being thus very different from that of a true Poisson bracket. In 
contrast, we will show that the reduced affine nonholonomic bracket, {•, -}^, satisfies the Jacobi identity 
after multiplication by a conformal factor. This allows us to effectively write the reduced equations of 
motion in Hamiltonian form after a rescaling of time. 

Recall that the reduced space 1Z is represented as the embedded submanifold in M 6 = {(K fc ,7) : 
K 6 £ M 3 , 7 € M 3 } defined by the condition ||7|| = 1. We will give explicit formulae for the brackets in 
these coordinates but we first need to derive some formulas. 

We begin by writing the canonical symplectic form Qq in terms of our new fiber coordinates. In 
view of (|OTT) and (|Q3l we find, 

VLq = pi A dKf - K(p 2 Ap 3 - K2P3 A pi- K^pi A p 2 - rm y p 2 A p 3 + rm x p 3 A p\ 
+ ( T *£x) A dm x + (j*e y ) A dm y , 
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where e x , e y are the constraint one- forms on Q defined in (I6.20p . The pull-back of Qq to Ai via the 
inclusion map i : Ai T*Q can be expressed in the induced coordinates ((g; (x,y),K s ) as, 

l*Qq = piAdK* -Klp 2 Ap3-Klp 3 /\p 1 -Klpi/\p 2 +mr 2 (uJ s 1 p2/\p3+uj s 2 p 3 Ap 1 ) 
+mr((r*e x ) A duj s 2 - (r%) A dui{). 

Therefore, the restriction, of l*Qq to the space C = TAi PI ann{r*e x , r*e y } C T(T*Q) is given by 

C = ^ A dKf - K[p 2 A P3 — *Tfp 3 A pi — K|pi A p 2 + mr 2 (wjp 2 A p 3 + wfp 3 Api). (6.34) 

Proposition 6.7. T/ie restricted forms f^c afirf are expressed in body coordinates as 

n c = Xi A diff + KiA 2 A A 3 + K 2 X 3 A Ai + K|Ai A A 2 + mr^WgAi A A 2 + u>\X 2 A A 3 + uj 2 X 3 A A x ) 
-mr 2 ^( 73 Ai A A 2 + 72 A 3 A Ai + 71 A 2 A A 3 ), (6.35) 

Q c = Ai A dK% + K\X 2 A A 3 + K% A Ai + iffAi A A 2 

-mr 2 u;|(7 3 Ai A A 2 + 7 2 A 3 A Ai + 71 A 2 A A 3 ). (6.36) 

Proof. Adding and subtracting mr 2 {ui^p\f\p 2 ) to (|6.34p and using the identities Kf = gijKj, lo? = gijLUj, 
and pi = guXi, together with proposition 16.11 we get, 

= OiWijXi A dKj + CrkjgugirKjXi A A fc 

+ (gijg2ig3r + S2i33;Sir + g3jgug2r)(-Kj + mr 2 u^)A ; A A r - mr 2 ul(gi j g 2 i)X j A A;. 

Since 5 G SO(3) we have, ft^y = det(g) = 1, and g^gu - g%j9xi = Cjikg3k = Cjiklk- Using these 
identities in the above expression for Qc gives (|6.35p . The proof of (|6.36p is immediate in view of 
(I6~33D . □ 

Use ((<?; x, y); K b ) as coordinates for Ai. Denote by d K b G TAi the tangent vector obtained as a 

derivation with respect to Kf when considered as a coordinate on Ai. Similarly, denote by X\ ,d x , d y 

the tangent vectors to Ai, with zero component in the directions of d K b that push forward to the 

3 

tangent vectors Xj , d x ,d y G TQ by the composition t o l : Ai ^ T*Q — > Q. The latter always exist 
since Ai is a vector bundle over Q. We claim that 

B := {Af ft + r(g 2i d x - g u d y ), d Rb : i = 1, 2, 3} (6.37) 

i 

is a basis for the subspace C = TAi Pi annjT*^, T*e ?/ }. The vectors are tangent to Ai by definition. 
Moreover, they are annihilated by r*e x and T*e y since in body coordinates we have 

e x = dx - rg 2j Xj, e y = dy + rg^Xj. 

It is also immediate to check that they are linearly independent and span C. 
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The Reduced Standard Nonholonomic Bracket 

The following proposition gives explicit formulae for the reduced standard bracket. 
Proposition 6.8. We have 

{Kl K)} n = -a fl (K* + mr 2 (^ - ^ 7 /)) , IjH = -cyrTl, {lUljH = 0. 

Remark Notice that the quantities and u> b , appearing in the above formulae, can be expressed 
in terms of our coordinates (K 6 ,7) via (|6.26p and (|6.27|) . Also notice, by a short calculation, that 
according to the above formulae, ||7|| 2 is a Casimir function, so they indeed define a bracket on TZ. 

Proof. Use ((<?; x, y)\ K fe ) as coordinates for A4. With the definition given for equation (|2.8p . we claim 
that 

X c Kl = Af ft + r{g 2i d x - 9li d y ) + Cijl (K? + mr 2 ^ - u s 3ll ))d K ,, X% = c ijni B K t. 

To show the claim, first note that the above vector fields indeed lie on C as they are expressed in terms of 
the basis B defined in (|6.37p . Next, putting 7^ = g^i and using proposition 16.11 we find dr/i = —Ciji'fiXj, 
and in view of (I6.35P one verifies by a direct calculation that the above vector fields satisfy: 

i x c n c = (dKf)c, i x c &c = (dji)c- 

K b 7; 
% 

In addition, by proposition 12.2^ the above vector fields equal VX K b and VX^ respectively. Therefore, 
by definition of the standard nonholonomic bracket {■, - }m given in (I2.11|) we have, 

{KlK}} M = -{dK$,PX K >), {Kl l3 } M = -(d 7j ,VX Kt ), H,jj}M = -(dj^VX^). 

The result now follows by computing the above pairings explicitly, and then using invariance of K fe and 
7 and the definition of the reduced standard nonholonomic bracket {-,-}tz- D 

In view of the above proposition we find the following expressions for the (almost) Hamiltonian 
vector fields associated to the coordinate functions on TZ with respect to the standard reduced bracket 

{vhe: 

Xf <b = Cijl {K\ + mr 2 {J{ - u s 3ll )) d* b + cijmd* X? = -Ci jni d% b , (6.38) 
where d^ b and denote derivations on TZ with respect to K b - and jj. We are now ready to show 

Theorem 6.9. The characteristic distribution of the reduced standard bracket, U := {X'p : F G 
C°°(TZ)} C TTZ, is non-integrable. 

Proof. At every point in TZ, any such vector field Xp is a linear combination of the six vector fields 
defined in (|6.38p . At a generic point in TZ only four of them are linearly independent. To see this, first 
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recall that they are all annihilated by |d||7|| 2 since they are vector fields on 7Z. In addition, a direct 
calculation shows that they are also annihilated by the one-form: 



a : = dKI + mr 2 u; b ■ dj = d(K b ■ 7) + mr 2 u b ■ dj 
(Kt + mr^dji + ndKf. 



Thus, a annihilates any vector in U. The crucial point is that a is not closed. To formally show that 
U is non-integrable we will prove that 

da(x% b ,X^ > 0. (6.39) 

Suppose for the moment that the above inequality holds. In view of the well known identity da(X, Y) = 
X(a(Y)) — Y(a(X)) — a([X,Y]), and since a annihilates both XV: b and Xj}, inequality (I6.39|) implies 

i 

that there exists iq £ {1,2,3} such that 



a 



^ 0, (no sum over iq) 



and non-integrability follows from Frobenius' theorem. 

To show that (|6.39p holds, notice that in view of (|6.27[) we can write lo\ = T^ijKj where T{^)ij 
are the components of the 7 dependent, 3x3 matrix 

It is clear from the above expression that T(rf) is symmetric and positive definite. We can then write 



da = mr z duj° A 7; = mr z T(j) ij dK 1 ' A dji + K- - yiJ J dj k A dj, 

V c*7fc 

Therefore, a direct calculation using (|6.38|) gives 

da (x™ b ,X?f\ = T^ijcukcijrjklr- 
Using the identity Q ifc Qjr = Sijbkr ~ 5 iT 5 jk , and ||7|| 2 = 1 we get: 

da (Xfyxty = trace (T( 7 )) - T( 7 ) ij7aj . 
The above quantity is strictly positive since T( 7 ) is positive definite and || 7 || = 1. □ 

As it was mentioned at the end of section O the non-integrability of the characteristic distribution 
implies that there cannot exist a conformal factor for the reduced standard bracket that Hamiltonizes 
the problem. As we shall see, the situation is quite different for the reduced affine bracket. 
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The Reduced Affine Bracket 

The following proposition gives explicit formulae for the reduced affine nonholonomic bracket {•, in 
the coordinates (K b ,"f) for the reduced space 1Z. 

Proposition 6.10. We have 

{Kl Kf} n = -dp (k\ - mr 2 u s 3ll ) , {K\, = - w , H,7j} n = 0- 
The remark made after the statement of proposition 16,81 also applies here. 
Proof. The proof is analogous to that of proposition 16.81 The crucial point is to derive the identities, 

X*t = Xl eit + r(g 2i d x - g u d y ) + Cij i(K? - mr 2 ^ 7i ))^, X%. = c ija id K b. 

using expression (|6.36p for tic- □ 

Using this expressions for the bracket we can now show: 

Theorem 6.11. The angular momentum with respect to the vertical axis, = K fe • 7 is a Casimir 
function of the affine reduced bracket {•, i.e. {F, K. b ■ 7}^ = for all F G C°°(7Z). 

Proof. A simple calculation shows {Kj, K fe • 7}^ = {jj, K. b ■ 7}^ = 0. □ 

Therefore, in contrast with theorem 16.91 we have, 

Corollary 6.12. The characteristic distribution of the reduced affine bracket, U = {X^r : F € 
C°°(1Z)}, is everywhere tangent to the foliation oflZ defined by the level sets of = K 6 • 7. 

So the properties of the two reduced brackets are fundamentally different. 
Hamiltonization of the Reduced Affine Bracket 

An even stronger result than the one given in corollary 16.121 is that the strictly positive function 
ji : Q/G = S 2 — > M. given by ^(7) = Y^) 1 / 2 , with Y(p/) = 1 — mr 2 (j ■ A^j), is a conformal factor for 
the affine reduced bracket. Define the new bracket {•, -} K of functions on 1Z by the rule: 

{F 1 ,F 2 }^:=^{F 1 ,F 2 } n . (6.40) 

This is exactly the bracket for the Chaplygin sphere problem given by the authors in [H \7\ . 

Theorem 6.13. The bracket {•, of functions on 1Z defined by satisfies the Jacobi identity. 

The proof is a long calculation that will not be included due to space constraints. We can provide 
the details upon request. Define a new time r by the rescaling: 

dt = fi dr. 
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The reduced equations of motion can be written in Hamiltonian form in the new time r as: 

^ = {F,n n }n, for all FeC°°(n). (6.41) 

Moreover, the function M| = K fe • 7 is also a Casimir function for the scaled bracket {•, -}^. So the 
above equation defines a two degree of freedom Hamiltonian system in each level set of M|. 

The Reduced Equations of Motion and their Integrability 

We now write explicitly the equations of motion and discuss their integrability in the context of the 
Hamiltonization discussed above. By differentiating the reduced Hamiltonian (|6.3ip one finds after a 
lengthy but straightforward calculation, 

dHn b dTi-R, 2 s( b s \ 

Using these expressions and any of the reduced nonholonomic brackets (either the standard or the 
affine) one computes the reduced equations of motion, 

K b = -CijiKfJj, 7i = Cjiaiu), where = 

In vector form we obtain the equations that are usually found in the literature: 

K b = K b x J, 7 = 7 x uo b . 

These equations have the geometric integral ||7|| 2 = 1, the conserved quantity arising from the SE(2) 
symmetry, K 6 • 7, and the energy integral, %n- In addition, the function J := ^K 6 • ~K b = ^5ijK b K b 
is directly seen to be in involution with Tin (with respect to any of the brackets in 1Z). In addition to 
these integrals, one can show that the measure [i('y)~ 1 d'K b d'y is preserved by the flow. It follows that 
the system is integrable by quadratures by Jacobi's theorem on the last multiplier. 

The reduced equations were first solved by Chaplygin, [S], in terms of hyper-elliptic functions. A 
summary of the integrability can be found in [HE] were it is shown that the solutions define rectilinear 
nonuniform motion in two dimensional tori. The algebraic integrability of the system is considered in 
|llj and a complete complex solution can be found in |16j . 

In (|6,4ip the equations of motion were written in Hamiltonian form in the new time r. In each 
symplectic leaf, defined as a level set of K fe • 7, we have a two-degree of freedom Hamiltonian system. 
Since Tin and J are in involution, and their level sets are compact, we have an integrable Hamiltonian 
system in the Liouville sense. 

From Liouville's theorem we recover the results given in [TJ [H] of uniform rectilinear motion in the 
new time r on two-dimensional tori. Notice that the existence of a preserved measure follows directly 
from the Hamiltonization of the problem, it is a multiple of the Liouville measure for the rescaled 
Hamiltonian flow on each symplectic leaf. In fact, any (almost) Hamiltonian vector field with respect 
to the reduced affine nonholonomic bracket {•, will preserve the same measure. In particular, this 
is the case for the vector field Aj that by a direct calculation can be shown to be given by 

Xf = -CijtK^Au^t - c ij a i (Au b ) l . 
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Moreover, since Ti-ji and J are in involution, the vector fields Xj~ and commute after scaling 

them by ^(7). The observation that these two vector fields commute already appears in [11] where no 
reference to the Hamiltonization of the problem is made. 

7 Final Remarks 

To obtain the Hamiltonization of the Chaplygin sphere problem by reduction we were forced to intro- 
duce the notion of affine almost Poisson brackets. At this point the presence of the particular affine 
form Qq given by (|6.32p that defines the "correct" bracket remains a mystery. The question remains 
open to give a useful characterization of this form in a more general setting. 

A possible approach is to consider the affine almost symplectic counterpart. In broad lines, this 
approach generalizes the theory of reduction given in |2H [2j [27] by allowing the formulation to be made 
in terms of an affine almost symplectic structure. This approach has the flavor of reduction by stages, 
it sheds some light on the need of the affine term Oo, and is part of the content of [18] . 

Another approach is to consider the reduction of nonholonomic systems using Dirac structures as 
was recently developed in [20]. The presence of the affine term Qq could be related to the theory 
of Poisson geometry with a 3- form background as introduced in [29J. It seems to be a rather strong 
coincidence that the building block to define the form f2o is precisely the Cartan three- form on SO (3). 

We end up by stressing that the key property of the affine Poisson structure that we have considered, 
is that the conserved quantity ifj becomes a Casimir function of the reduced bracket. This was not the 
case with the standard nonholonomic bracket. It is thus natural to ask the following question: Suppose 
that a Lie group H is a symmetry group of a nonholonomic system and that there are conserved 
quantities associated with its action. Suppose in addition that these conserved quantities are invariant 
under the action. Does there exist a (possibly affine) nonholonomic bracket for the system such that 
the conserved quantities are Casimir functions for the corresponding reduced bracket? This issue is 
also treated in [18] . 
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